Group Delay Function from All-Pole Models for Musical Instrument Recognition

نویسندگان

  • Aleksandr Diment
  • Padmanabhan Rajan
  • Toni Heittola
  • Tuomas Virtanen
چکیده

In this work, the feature based on the group delay function from all-pole models (APGD) is proposed for pitched musical instrument recognition. Conventionally, the spectrum-related features take into account merely the magnitude information, whereas the phase is often overlooked due to the complications related to its interpretation. However, there is often additional information concealed in the phase, which could be beneficial for recognition. The APGD is an elegant approach to inferring phase information, which lacks of the issues related to interpreting the phase and does not require extensive parameter adjustment. Having shown applicability for speech-related problems, it is now explored in terms of instrument recognition. The evaluation is performed with various instrument sets and shows noteworthy absolute accuracy gains of up to 7% compared to the baseline mel-frequency cepstral coefficients (MFCCs) case. Combined with the MFCCs and with feature selection, APGD demonstrates superiority over the baseline with all the evaluated sets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using group delay functions from all-pole models for speaker recognition

Popular features for speech processing, such as mel-frequency cepstral coefficients (MFCCs), are derived from the short-term magnitude spectrum, whereas the phase spectrum remains unused. While the common argument to use only the magnitude spectrum is that the human ear is phase-deaf, phase-based features have remained less explored due to additional signal processing difficulties they introduc...

متن کامل

Modified Group Delay Feature for Musical Instrument Recognition

In this work, the modified group delay feature (MODGDF) is proposed for pitched musical instrument recognition. Conventionally, the spectrum-related features used in instrument recognition take into account merely the magnitude information, whereas the phase is often overlooked due to the complications related to its interpretation. However, there is often additional information concealed in th...

متن کامل

Classiication of Musical Instrument Sounds Using Neural Networks

This study introduces the classiication of musical instrument sounds by artiicial neural networks (ANN). The time varying spectral contents of sounds are estimated based on Short-time Fourier Transform (STFT) and are applied to ANN structures for classiication. Recognition results obtained from a multilayer perceptron (MLP), time delay neural network (TDNN) and a hybrid self organizing map radi...

متن کامل

Efficient analysis/synthesis of percussion musical instrument sounds using an all-pole model

It is well-known that an impulse-excited, all-pole lter is capable of representing many physical phenomena, including the oscillatory modes of percussion musical instruments like woodblocks, xylophones, or chimes. In contrast to the more common application of all-pole models to speech, however, practical problems arise in music synthesis due to the location of poles very close to the unit circl...

متن کامل

Robust Design of Very High-Order Allpass Dispersion Filters

A nonparametric allpass filter design method is presented for matching a desired group delay as a function of frequency. The technique is useful in physical modeling synthesis of musical instruments and emulation of audio effects devices exhibiting dispersive wave propagation. While current group delay filter design methods suffer from numerical difficulties except at low filter orders, the tec...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013